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LAYER BASED COMPRESSION OF DIGITAL IMAGES 
Field of Invention 

The present invention relates generally to a method and apparatus for 
compression of digital images, and in particular, for layer based image compression. 

Background 

In large organizations where huge amounts of documents are handled, there is a 
demand for compression of digital images. For example, banks store digital formats of 
checks. More than 200 million checks are written is USA every day, and some banks 
store the digital format of the checks on line for several months. Additionally, 
government offices store great quantities of official forms and documents. In order to 
decrease the size of the digital information, many of the documents are compressed. 
Compression of digital images decreases the required storage capacities and reduces 
the transmit times of the images. 

In some instances, images may comprise different types of information, such as, 
text and graphics. As such, the digital image may either be stored as a single image, or 
may be separated to several images, each image compressed and stored separately. 
There are various known in the art methods for image separation and compression, 
such as image segmentation and JPEG, respectively. 

If the image is saved as a single image, it typically requires a lot of storage 

space. It is therefore desireable to save the original image as two or more separate 

images. However, techniques to separate text from graphics are less than optimal. 

Thus, there exists a need for improved image separation and compression tools. 
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Summary 

There is therefore provided, in accordance with an embodiment of the present 
invention, a method for image separation of an image, wherein the image includes 
pixels. The method includes the steps of identifying kernels, whereby characteristics of 
5 the kernels are reflected by at least one of the operators P(x-w,y) - P(x,y) >t AND 
P(x+w, y) - P(x,y) > t ; or P(x, y-w) - P(x,y) > t AND P(x, y+w) - P(x,y) > t ; or P(x+d, 
y+d) - P(x,y) > t AND P(x-d, y-d) - P(x,y) > t ; or P(x-d, y+d) - P(x,y) > t AND P(x+d, y-d) 
- P(x,y) > t, wherein the kernels include at least some of a first group of the pixels. The 
kernels are associated with a first layer, and pixels that are not associated with the first 

C$0 layer, are classified as a second layer. The first layer may be text or graphics and the 

©S second layer may be a background. 

|j Alternatively, the first layer is darker than the second layer, or conversely, the 

s first layer is lighter than the second layer. In some embodiments, identifying kernels 
fU includes performing a binarization technique or performing text binarization. Identifying 

ry 

35 kernels may also include examining grey characteristics of pixels in an expansion of 

fsj 

" • the kernels, wherein the expansion is less than or equal to 3 times w, wherein w is a 
typical stroke width of the image. 

In some embodiments, the method further includes storing the first layer, and 
compressing the first layer with a high resolution compression technique. The method 
20 may further include compressing the second layer with a high lossy compression 
method. 
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There is therefore provided, in accordance with an embodiment of the present 
invention, a method of compressing an image having pixels. The method includes the 
steps of identifying four grey levels, wherein each of the four grey levels is mapped by 
an associated two bits, identifying per each the pixel an associated grey level, wherein 
the associated grey level is one of the four grey levels, storing per pixel, an associated 
pixel location and the associated grey level, and storing the mapping of each the four 
grey levels. The pixel location may be a mask image. 

The method may also include dividing the image into tiles, and per tile, 
performing the steps of identifying, identifying, storing and storing. 

There is therefore provided, in accordance with an embodiment of the present 
invention, a method of compressing an image having pixels. The method includes the 
identifying Y grey levels, wherein Y is greater than 2, wherein each of the Y grey levels 
is mapped by the root of Y associated bits and wherein the number of bits is log 2 Y. The 
method further includes identifying per each the pixel an associated grey level, wherein 
the associated grey level is one of the Y grey levels, storing per pixel, an associated 
pixel location and the associated grey level, and storing the mapping of each the Y grey 
level. In some embodiments Y is 2 and the number of bits is 1 . 

There is therefore provided, in accordance with an embodiment of the present 
invention, a method including the steps of identifying first and second image content; 
and separately compressing the first and second image content. The first and second 
content may include image foreground and image background. A higher quality 
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compression technique may be used to compress the foreground content, as compared 
to the compression technique used to compress the background content. 

There is therefore provided, in accordance with an additional embodiment of the 
present invention, a system for separating an image, the system including a scanner, a 
processor and a memory. The scanner may create a digital image. The processor may 
separate the digital image into a first and second layer, and compress the first layer with 
a first compression technique and the second layer with a second compression 
technique. The memory may store the compressed first and second layers. 

The processor may include a means for identifying kernels reflected by at least 
one of the operators selected from the group consisting of: P(x-w,y) - P(x,y) >t AND 
P(x+w, y) - P(x,y) > t ; and P(x, y-w) - P(x,y) > t AND P(x, y+w) - P(x,y) > t ; and P(x+d, 
y+d) - P(x,y) > t AND P(x-d, y-d) - P(x,y) > t ; and P(x-d, y+d) - P(x,y) > t AND P(x+d, 
y-d) - P(x,y) > t; and associating the kernels with the first layer. The means for 
identifying may be a text binarization tool. The processor may also include a 
compression means for compressing the first layer with a high resolution compression 
technique. 

The processor may include a compression means for compressing the second 

layer with a high lossy compression method. The processor may include restoration 

means for creating a restored digital image from the compressed first and second layer. 

The compressed first layer may include a binary mask of the foreground layer, 

compressed grey level foreground layer data, and quantization grey levels. The 

compressed grey level foreground layer data may be stored a two bit buffer or a one bit 

buffer. The quantization grey levels may include four levels or two levels. 
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There is therefore provided, in accordance with an additional embodiment of the 
present invention, a computer software product, including a computer-readable medium 
in which program instructions are stored, which instructions when read by the computer, 
separates an image into a first and second layer, and compresses the first layer with a 
first compression technique and the second layer with a second compression 
technique. 
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Brief Description of Figures 

The present invention will be understood and appreciated more fully from the 
following detailed description taken in conjunction with the appended drawings in which: 

Fig. 1 is an illustration of a multi-layer compressed grey image; 

Figs. 2A and 2B are illustrations of a foreground and a background layer, 
respectively, constructed and operative in accordance with a preferred embodiment of 
the presnt invention; 

Fig. 3 is a flow chart of a layer separation method, operative in accordance with 
a preferred embodiment of the present invention; 

Fig. 4 is an illustration of a tile, constructed and operative in accordance with a 
preferred embodiment of the present invention; 

Fig. 5 is a histogram of a foreground image; 

Fig. 6 is an illustration of an image restored from compression methods 
operative in accordance with a preferred embodiment of the present invention; and 

Fig. 7 is a block diagram of an image separation and storage system constructed 
and operative in accordance with a preferred embodiment of the present invention. 
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Detailed Description 

One embodiment of the present invention is a method and apparatus for layer 
based compression of digital images. One element of the present invention describes 
separating an image into two layers, a foreground and a background. In some 
embodiments, the foreground may be a text layer. 

The present inventors discovered that not all the data in an image are of equal 
importance. Some of the data may be more important than other data. As an example, 
in a check, the foreground or text (date, amount, signature, etc.) may be more important 
than the background of the check. Hence, when compressing and storing the check 
image, and later restoring the check image from the compressed data, it may be more 
crucial for high quality restoration of the text, than for exact restoration of the 
background. 

One element of the present invention therefore describes method and apparatus 
for compressing the foreground and the background separately, typically with the 
foreground stored with a higher quality compression method than the background. In 
one embodiment of the present invention, the foreground may be compressed with an 
almost non-lossy compression method, and the background may be compressed with a 
high lossy compression method. 

In particular, it is quite usual that the foreground requires high quality, high 
resolution shape information. However, the value of greyness is not crucial for the good 
quality of the restored image. Hence, coarse quantizers can be applied in order to save 
space. For the background, the opposite approach may be much better; there, low 
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resolution may be tolerated. However, since fidelity of greyness is of importance, fine 
quantizers must be used. Moreover, since foreground and background tend to be 
different in nature, it may happen that even for a given compression ratio, different 
algorithms should be used for each of these two image segments. 

An additional element of the present invention describes a method for 
compressing the foreground of an image. 

In an embodiment of the present invention, the following data may be stored for 
each image. The stored data may then be used for the restoration of the image. 

1. A binary mask of the foreground layer (described in connection to Figs. 1 4). 

2. Grey level foreground layer data compressed into a two bit or a one bit buffer 
(described in relationship to Fig. 5). 

3. Quantization grey levels for each tile four levels for two bit compression, and two 
levels for one bit compression (described in relationship to Fig. 5). 

4. A compressed background layer (described in relationship to Figs. 1 4) 

It is noted that each above point is described in detail hereinbelow. 

Reference is now made to Figs. 1, 2A and 2B, illustrations of a multi-layered 
image 10, a foreground 12 and a background 14, respectively. Typically image 10 is 
a grey scale image, such as a check, or any other document comprising a first layer 
and a relatively lighter second layer, such as a background. 
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Image 10 herein may comprise foreground layer 12 and background layer 14. 
Typically foreground 12 may comprise text and graphics. The text and graphics may 
be relatively darker than background 14. Alternatively, the text and graphics may be 
relatively lighter than background 14. 

Reference is now made in parallel to Fig. 3, a flow chart outlining an embodiment 
of the present invention wherein image 10 is separated in foreground 12 and 
background 14. It may be helpful to read the below flow explanation with reference to 
Figs. 1 2B. 

A document may be scanned (step 20), thereby creating digital grey scale image 
10. As is known in the art, image 10 may comprise pixels, wherein the pixel grey level 
range is 1 byte per pixel. Alternatively, the pixel level range is 4 bits, or any other 
appropriate grey scale level. 

The foreground 12 may be separated from the background 14. In the present 
embodiment, a method may be used such that the text kernels are first identified (step 
22). A kernel may be defined as comprising pixels that have common characteristics, 
i.e., all the pixels have a grey level over a threshhold t are part of a kernel. 

Methods for identifying kernels are described below in reference to the below 
operators (1) (4) and Fig. 4. Once the text kernels are identified, pixels in the vicinity to 
the text kernels may also checked. If appropriate, they too may be classified as 
foreground 12. 

It is noted that as may happen from time to time, pixels may be mis-classified as 
being text when they are not text, e.g. they are actually background 14, or vice versa. 
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When compiling the foreground layer 12, these mix-classified pixels will appear as 
spikes. By morphology checks such pixels may be reset properly. 

The identified kernels may then be complied into a binary mask 1 1 of foreground 
layer 12. The mask 1 1 of foreground layer 12 may then be completed by removing the 
spikes from mis-classified pixels. In some embodiments, in order to create the mask 
1 1 , dividing image 10 into multiple tiles may precede step 22. 

In alternative embodiments, separation of the foreground from the background 
may be accomplished via standard binarizaiton techniques. In some embodiments, 
binarization may be performed using alternative text binarization methods such as that 
described in US patent application 09/210327, Method of Binarization in an Optical 
Character Recognition System, assigned to a common assignee and included herein in 
as reference. 

The mask 1 1 may then be compressed (step 24) by the any standard method 
(e.g. group 4, JBIG, JBIG2) and stored for the restoration step. In a conservative mode, 
in order to preserve foreground pixels for restoration, the mask 1 1 may be scaled down 
by two in the X direction. 
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Image 10 may be divided (step 26) into multiple tiles 16, labeled herein as tiles 
16A to 16N. The size of tiles 16 is arbitrary, however, may typically be a size useful for 
compression purposes. It is noted that each tile 16 may have different characteristics 
and compression parameters. As an example, tile 16A may comprise more text, and 
5 thus have a relatively darker average grey level than the tile 16 next to it, or the average 
tile 16. Conversely, tile 16H may comprise more background, and thus have a 
relatively lighter average grey level than the average tile 16. 

The grey level of the pixels in mask 1 1 may then be quantizied (step 30), 
compressed by a non-lossy method such as that described herein below in relationship 
C|0 to Fig. 5, and stored. 

Kf As a short mid-summary, it is noted that foreground 12 may comprise mask 11 

y (step 22) and grey level value of the pixel of the mask 1 1 (step 30); this correlates to 

m 

i items 1 and 2 from the list above. 

■fit 

bJ- By default, what is not the mask 1 1 is background 14 (step 32). When mask 1 1 

o 

l|5 is separated from image 10, holes may be left in the remaining image (background 14). 

fii 

These holes may be filled (step 34) with pixels having an average grey value equal to 
that of the background pixels in close vicinity to the holes. 

Background 14 may then be compressed and stored (step 36) using various 
compression procedures, such as sub-sampling of the background layers. As an 
20 example, the scale down factor may be predefined to produce a required restored 
background quality. Alternatively, state of the art compression procedures such as 
JPEG, JPEG2000, etc. may be used. The compression rate may be set to produce a 
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required restored quality. Yet further alternatively, the compression procedure may be 
a combination of sub-sampling and compressing of the sub-sample image by known 
compression procedures. It is noted that background 14 may have little to no 
contribution to the readability of the image textual content, and as such, relatively high 
lossy compression methods may be used. 

The steps listed hereinabove may be performed for each tile 16, thus building 
the entire image 10. 

In conclusion of the above discussion, it is therefore noted that the above 
described method may provide the ability to separate an image into 2 layers, compress 
each layer with a different compression method, and store appropriate layer data for 
future use in restoration of the image. 

It is noted that separating the text from background figures is a difficult problem. 
Prior art separating solutions typically separated the image into layers of text, figures, 
background figures, background texture, etc. However, when text is written on a figured 
background, it is difficult for the prior art separation techniques to identify that part of 
the image that is text, and that part of the image that is the figured background. 

The inventors of the present invention discovered that it is simpler to identify the 
relatively important data, the text, using text binarization like methods, and not attempt 
to identify the background, or less important layer. Therefore, as described above, the 
present invention identifies the text layer, and by default, the remaining image is the 
background layer. 
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Additionally noted, since it was so difficult for prior art methods to separate the 
text from the background figures, typically in prior art, the entire image is compressed 
with a single compression method. The inventors discovered that this procedure may 
waste valuable storage space and decreases readability in high compression rates, 
wherein, with the use of the present invention to separate important data (foreground) 
from less important data (background), it may be possible to store each layer using an 
appropriate compression method. 

A method for identifying foreground 12 will now be explained. Typically 
foreground 12 may comprise text having typical morphology features of strokes that are 
relatively darker than background 14. The following operators that reflect those 
features of stroke are: 

(1 ) P(x-w,y) - P(x,y) >t AND P(x+w, y) - P(x,y) > t ; OR 

(2) P(x, y-w) - P(x,y) > t AND P(x, y+w) - P(x,y) > t ; OR 

(3) P(x+d, y+d) - P(x,y) > t AND P(x-d, y-d) - P(x,y) > t ; OR 

(4) P(x-d, y+d) - P(x,y) > t AND P(x+d, y-d) - P(x,y) > t , 

wherein P(x,y) is a pixel value at (x,y), w is a typical stroke width, d is w/2 and t is 
a relative threshold (contrast). The stroke width d, and the relative threshold t, may be 
predefined or may be estimated from the input image. A detailed description of this 
method is included in patent US patent application 09/210327, Method of Binarization 
in an Optical Character Recognition System, assigned to a common assignee and 
included herein in as reference. 
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A pixel may be labelled as text if the relative threshold t of the pixel is darker than 
the background in a vicinity of a stroke width w. By applying one of above operators on 
the image pixels, most of the text pixels are correctly labelled as text. The text pixels 
are identified as belonging to foreground layer 12. 

It is noted that similar operators can be used when the text is lighter than the 
background. 

However, there may be remaining text pixels that the above technique does not 
recognize as text. These yet unrecognised text pixels may be labelled as text in the 
following procedure. To further understanding, it may be helpful to read this 
explanation together with Fig. 4, an illustration of an exemplary tile 16. In each tile 16, a 
grey level threshold 17 is defined from the grey levels of labelled text pixels 18. The 
pixels 18 in close vicinity to the text kernels, with grey level below threshold 17 are now 
labelled as text. 

A method for compressing foreground 12 will now be described. 

Normally, in grey scale images, the pixels grey level dynamic range is [0, 255] 
i.e. one byte per pixel. However, the local dynamic range of text pixels within a small 
area may be much smaller. Therefore, if the pixel level is quantized, pixels may be 
stored at less than one byte per pixel. The present invention describes alternative 
compression schemes, a two bit per pixel compression scheme and a one bit per pixel 
compression scheme. 

Two bit per pixel compression four level quantization 
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For each tile 16, a grey level histogram of pixels 18 in foreground 12 and 
background 14 is calculated. To aid this discussion, reference is now made in parallel 
to Fig. 5, an exemplary grey level histogram 40 of the pixels 18 in foreground 12 of 
exemplary tile 16A, constructed according to an embodiment of the present invention. 
5 Four dominant levels 42, 44, 46 and 48 are deduced from histogram 40. The levels 42 
48 are assigned values; the lowest dominate level, level 42 is 0, e.g. [00]; level 44 is 1, 
e.g. [01]; level 46 is 2, e.g. [10]; and level 48 is 3, e.g. [1 1]. 

Using the nearest neighbour scheme, or alternative methods, pixels 18 may be 
mapped to one of the four dominant levels, as appropriate. As an example, pixels 

Mr 

Hf 0 having a level of 190 may be mapped with dominant level 46 and given a value of 3. 

O 

^ The pixels 18 in exemplary tile 16 may then be stored in a buffer, 2 bit per pixel, 

H 

i'Jj thereby producing a compression ratio of 1 :4. If higher compression rates are 

ill 

, required, the buffer may be further compressed by any non-lossy method. 

•<~*? 
yy 

m Additionally, the four dominant grey levels for each tile 16 may also stored. As 

r|5 such, for exemplary tile 16A, levels 96, 148, 210 and 240 may be stored representing 

I 

dominant levels 42 48, respectively. Upon restoration, by referring to the stored levels, 
it is possible to restore pixels 18 to their original dominant grey levels. It is noted that 
the histogram for each tile 16 may be different, and hence, the dominant levels 42 48 
may vary from tile to tile. The present method provides method and apparatus to store, 
20 and restore, each pixel for each tile with the appropriate grey level. 

One bit per pixel compression two level quantization 
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Alternatively, it may be desirable to store each pixel in one bit. The above 2 bit 
per pixel compression technique may be adapted to a two level quantization scheme. 
Thus, instead of 4 dominant levels with a 0 [00] 3 [1 1] range, the present scheme may 
have 2 dominant levels with a 0 [0] 1 [1] range. The mapping may be accomplished as 
follows: 

Level 42 (e.g. 0) is mapped to 0 and level 48 (e.g. 3) is mapped to 1 . Levels 44 
and 46 (e.g. 1 and 2, respectively) may be mapped to 0 if pixel 18 is darker than its 
close neighbours. A pixel 18 is darker if it satisfies one of the operators noted above, 
where P(x, y) is the two bit level [0-3], t is 0 and d is predefined and set close to a 
typical stroke width in the image. If pixel 18 does not satisfy the operator, it is mapped 
tot 

The pixels 18 in the exemplary tile 16 may then be stored in a buffer, one bit per 
pixel. If higher compression rates are required, the buffers may be further compressed 
by any non-lossy method. Similar to the method above, for each tile 16, two actual grey 
scale levels are estimated from the histograms, and are stored for the restoration step. 

Reference is now made to Fig. 6 a restored image 50 of image 10. Restored 
image 50 may comprise a restored foreground layer 52 and a restored background 
layer 54. To reiterate information from the above, the stored data of image 10 may be: 

1. A binary mask of the foreground layer 

2. Grey level foreground layer data compressed into a two bit or a one bit buffer 
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3. Quantization grey levels for each tile; four levels for two bit compression and 
two levels for one bit compression. 

4. A compressed background layer. 

Foreground 52 is restored by inverse mapping of the stored values of items 1 - 3. 
The results may be a grey scale image of the foreground pixels only. 

The restoration of the background 54 from item 4 depends on the chosen 
compressing procedure. If one of the conventional compression methods was used, 
then the decompression is straightforward. If sub-sampling was used, than the scale up 
may be done by any interpolation methods. Bilinear interpolation may give good results. 
Image 50 may then be restored by pasting the restored foreground layer 52 over the 
restored background layer 54. Finally, it may be useful to apply post-filter in order to 
smooth the difference between the foreground and background segments. 

It is thus noted that the present invention may provide the use of two different 
compressions paths, for the foreground and the background, enabling high 
compression ratios, without readability penalties. 

Additionally, the present invention may provide the quantization of the 
foreground levels by tiles, providing a more realistic dynamic range, which may produce 
a restoration closer to the original image. The textual parts in the restored image may 
be more realistic. 

It is noted that in cases where the background is negligible, a synthetic 
background can be used. In such an instance, there may be no need to compress and 
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to store the background, and so, a higher compression rate may be achieved. It is 
additionally noted that in images where the contrast is almost constant, the foreground 
layer may be found by any binarization tools. 

In alternative embodiments, the space division may be done by connected 
components rather than by tiles. The foreground layer may be quantized and stored by 
connected components. In the restoration phase, the connected components may be 
recreated from the foreground mask image. 

Reference is now made to Fig. 7, a block diagram of an image separation and 
storage system 60. System 60 may comprise a scanner 62, a processor 64 and 
memory 66. Scanner 62 may be used to create a binary image 63 of the image to be 
separated and stored. The binary image 63 may then be transmitted to processor 64, 
which may separate the binary image into a first layer 65 and a second layer 67, each 
layer may then be compressed as described in detail hereinabove. 

First layer 65 may therefore comprise, as noted above, a binary mask of the 
foreground layer, grey level foreground layer data compressed into a two bit or a one bit 
buffer, quantization grey levels for each tile (four levels for two bit compression and two 
levels for one bit compression). Second layer 67 may comprise, as noted above, a 
compressed background layer. Layers 65 and 67 may then be transferred to memory 
66 for storage. 

Process 64 may further create a restored image from layers 65 and 67. 
Additionally, processor 64 may comprise user interface devices, such as a keyboard, 
screen, etc. (not shown). The screen may be used to display any of the images; either 
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binary image 63, layers 65 and 67, and the restored image. Processor 64 may be used 
to monitor or control the processes, such as to request view of a specific image. 

It will be appreciated by persons skilled in the art that the present invention is not 
limited by what has been particularly shown and describe herein above. Rather, the 
5 scope of the invention may be defined by the claims that follow: 
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